NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Learning From Demonstrations: A Computationally Efficient Inverse Reinforcement Learning Approach With Simplified Implementation

https://doi.org/10.1109/TETCI.2025.3526502

Lin, Yanbin; Ni, Zhen; Zhong, Xiangnan (January 2025, IEEE Transactions on Emerging Topics in Computational Intelligence)

Free, publicly-accessible full text available January 20, 2026
Adaptable and Reliable Text Classification using Large Language Models

https://doi.org/10.1109/ICDMW65004.2024.00015

Wang, Zhiqiang; Pang, Yiran; Lin, Yanbin; Zhu, Xingquan (December 2024, 2024 IEEE International Conference on Data Mining Workshops (ICDMW))

Full Text Available
Transfer Contrastive Learning for Raman Spectroscopy Skin Cancer Tissue Classification

https://doi.org/10.1109/JBHI.2024.3451950

Wang, Zhiqiang; Lin, Yanbin; Terentis, Andrew C; Strasswimmer, John; Zhu, Xingquan (December 2024, IEEE Journal of Biomedical and Health Informatics)

Full Text Available
An Imitation Learning Method with Multi-Virtual Agents for Microgrid Energy Optimization under Interrupted Periods

https://doi.org/10.1109/PESGM51994.2024.10689197

Lin, Yanbin; Ni, Zhen; Tang, Yufei (July 2024, IEEE)

Existing computer analytic methods for the microgrid system, such as reinforcement learning (RL) methods, suffer from a long-term problem with the empirical assumption of the reward function. To alleviate this limitation, we propose a multi-virtual-agent imitation learning (MAIL) approach to learn the dispatch policy under different power supply interrupted periods. Specifically, we utilize the idea of generative adversarial imitation learning method to do direct policy mapping, instead of learning from manually designed reward functions. Multi-virtual agents are used for exploring the relationship of uncertainties and corresponding actions in different microgrid environments in parallel. With the help of a deep neural network, the proposed MAIL approach can enhance robust ability by minimizing the maximum crossover discriminators to cover more interrupted cases. Case studies show that the proposed MAIL approach can learn the dispatch policies as well as the expert method and outperform other existing RL methods.
more » « less
Full Text Available
LLMGeo: Benchmarking Large Language Models on Image Geolocation In-the-wild

Wang, Zhiqiang; Xu, Dejia; Khan, Rana Muhammad; Lin, Yanbin; Fan, Zhiwen; Zhu, Xingquan (June 2024, The 3rd CVPR Workshop on Computer Vision in the Wild)

Full Text Available
A Modified Maximum Entropy Inverse Reinforcement Learning Approach for Microgrid Energy Scheduling

https://doi.org/10.1109/PESGM52003.2023.10252933

Lin, Yanbin; Das, Avijit; Ni, Zhen (July 2023, IEEE Power Energy Society General Meeting)

Full Text Available
Multi-Virtual-Agent Reinforcement Learning for a Stochastic Predator-Prey Grid Environment

https://doi.org/10.1109/IJCNN55064.2022.9891898

Lin, Yanbin; Ni, Zhen; Zhong, Xiangnan (September 2022, 2022 International Joint Conference on Neural Networks (IJCNN))

Generalization problem of reinforcement learning is crucial especially for dynamic environments. Conventional reinforcement learning methods solve the problems with some ideal assumptions and are difficult to be applied in dynamic environments directly. In this paper, we propose a new multi-virtual- agent reinforcement learning (MVARL) approach for a predator-prey grid game. The designed method can find the optimal solution even when the predator moves. Specifically, we design virtual agents to interact with simulated changing environments in parallel instead of using actual agents. Moreover, a global agent learns information from these virtual agents and interacts with the actual environment at the same time. This method can not only effectively improve the generalization performance of reinforcement learning in dynamic environments, but also reduce the overall computational cost. Two simulation studies are considered in this paper to validate the effectiveness of the designed method. We also compare the results with the conventional reinforcement learning methods. The results indicate that our proposed method can improve the robustness of reinforcement learning method and contribute to the generalization to certain extent.
more » « less
Full Text Available

Search for: All records